An Efficientbinarization Technique for Historical Document Images
نویسندگان
چکیده
Binarization is an important preprocessing step in several document image processing tasks. Binarization of historical document with poor contrast, strong noise, and non-uniform illumination is a challenging problem. A new binarization algorithm has been developed to address this problem. In this paper, we describe a new method which does not utilize statistical properties of the intensity histogram of a gray-scale image to determine a threshold. Our method is based on the “Divide and Conquer” strategy for calculating a threshold value which is based on the list of gray levels in that image. It uses low pass Weiner filter method as a preprocessing step to enhance the image by deblurring the image. It uses Median Filter method as a postprocessing step to reduce noise in the image. Our result shows that this new binarization method produce high quality binary image for historical document than any other global methods such as Otsu’s method.
منابع مشابه
A Novel Approach for Word Retrieval from Devanagari Document Images
Large amount of information is lying dormant in historical documents and manuscripts. This information would go futile if not stored in digital form. Searching some relevant information from these scanned images would ideally require converting these document images to text form by doing optical character recognition (OCR). For indigenous scripts of India, there are very few OCRs that can succe...
متن کاملRestoration of Degraded Historical Document Image: An Adaptive Multilayer-Information Binarization Technique
Binary image is the essential format for document image processing, and the operation of the subsequent steps depends on the quality of the binarization process. The objective of this research is to propose a new binarization method based on adaptive multilayer-information for restoration of degraded historical document images. This paper focuses on degraded Thai historical document images, whi...
متن کاملBlur Detection for Historical Document Images
FamilySearch captures millions of digital images annually using digital cameras at sites throughout the world. The top image quality problem encountered during this image capture process is blurriness due to an out of focus camera and/or motion during capture. Several automatic measurements of digital image blur exist, but have been found to not be very accurate at correctly classifying blurry ...
متن کاملA Hybrid Binarization Technique for Document Images
In this chapter, a binarization technique specifically designed for historical document images is presented. Existing binarization techniques focus either on finding an appropriate global threshold or adapting a local threshold for each area in order to remove smear, strains, uneven illumination etc. Here, a hybrid approach is presented that first applies a global thresholding technique and, th...
متن کاملAn Enhancement of Images Using Recursive Adaptive Gamma Correction
The “Adaptive Approach for Historical or Degraded Document Binarization” is that in which Libraries and Museums obtain in large gathering of ancient historical documents printed or handwritten in native languages. Typically, only a small group of people are allowed access to such collection, as the preservation of the material is of great concern. In recent years, libraries have begun to digiti...
متن کامل